Goto

Collaborating Authors

 metabolomic data


Combining low-dose CT-based radiomics and metabolomics for early lung cancer screening support

Zyla, Joanna, Marczyk, Michal, Prazuch, Wojciech, Socha, Marek, Suwalska, Aleksandra, Durawa, Agata, Jelitto-Gorska, Malgorzata, Dziadziuszko, Katarzyna, Szurowska, Edyta, Rzyman, Witold, Widlak, Piotr, Polanska, Joanna

arXiv.org Artificial Intelligence

Due to its predominantly asymptomatic or mildly symptomatic progression, lung cancer is often diagnosed in advanced stages, resulting in poorer survival rates for patients. As with other cancers, early detection significantly improves the chances of successful treatment. Early diagnosis can be facilitated through screening programs designed to detect lung tissue tumors when they are still small, typically around 3mm in size. However, the analysis of extensive screening program data is hampered by limited access to medical experts. In this study, we developed a procedure for identifying potential malignant neoplastic lesions within lung parenchyma. The system leverages machine learning (ML) techniques applied to two types of measurements: low-dose Computed Tomography-based radiomics and metabolomics. Using data from two Polish screening programs, two ML algorithms were tested, along with various integration methods, to create a final model that combines both modalities to support lung cancer screening.


Interpretable machine learning on metabolomics data reveals biomarkers for Parkinson's disease

#artificialintelligence

The use of machine learning (ML) with metabolomics provides opportunities for the early diagnosis of disease. However, the accuracy and extent of information obtained from ML and metabolomics can be limited owing to challenges associated with interpreting disease prediction models and analysing many chemical features with abundances that are correlated and'noisy'. Here, we report an interpretable neural network (NN) framework to accurately predict disease and identify significant biomarkers using whole metabolomics datasets without feature selection. The performance of the NN approach for predicting Parkinson's disease (PD) from blood plasma metabolomics data was significantly higher than classical ML methods with a mean area under the curve of 0.995. PD-specific markers that contribute significantly to early disease prediction were identified including an exogenous polyfluoroalkyl substance. It is anticipated that this accurate and interpretable NN-based approach can improve diagnostic performance for many other diseases using metabolomics and other untargeted'omics methods.


A comparative evaluation of the generalised predictive ability of eight machine learning algorithms across ten clinical metabolomics data sets for binary classification

#artificialintelligence

The multidisciplinary field of data science is concerned with extracting insights from data using a diverse set of computational methodologies, theories, and technologies (Blei and Smyth 2017). Within data science, there are two competing scientific philosophies: classical statistics and machine learning (Breiman 2001b). Classical statistics aims to formalise relationships between dependent and independent variables based on a clearly defined set of assumptions from which mathematical models are parametrised. The aim is to derive meaningful statistical inference (properties of an underlying probability distribution) for the measured variables, assuming that the observed data is sampled from a larger population. Conversely, machine learning uses ad-hoc computational algorithms that iteratively optimise (or'learn') without necessarily relying on any formal statistical assumptions (Bishop 1995).


Genome Medicine

#artificialintelligence

Breast cancer is the most frequently diagnosed cancer in women worldwide excluding skin cancer and it is ranked second for deaths among cancer patients [1]. Early diagnosis of breast cancer is crucial for patient prognosis. Currently, however, clinically diagnosed breast tumors have a median size of 2 to 2.5 cm [2], which are likely to be later stage (stage III) breast tumors that have already metastasized to axillary lymph nodes. A highly accurate diagnostic test for breast cancer is currently lacking. The standard mammography test has sensitivities of merely 54 to 77 % [3].